Statistical Analysis of Korean Pronunciation Variations
نویسنده
چکیده
In this paper, we present a statistical analysis of Korean pronunciation variations using a Grapheme-to-Phoneme (GTP) system. The GPT system generates pronunciation variants by applying rules modeling obligatory and optional phonemic changes and allophonic changes in spoken Korean. Experimental results using a PBS (Phonetically Balanced Sentence) Speech DB of 60,000 sentences show that the most frequently happening obligatory phonemic variations are in the order of liaison, tensification, aspirationalization, and nasalization of obstruent, and the most frequently happening optional phonemic variations are in the order of initial consonant /h/-deletion, insertion of final consonant with the same place of articulation as the next consonant’s, and deletion of final consonant with the same place of articulation as the next consonant’s. These statistics can be used for improving the performance of speech recognition systems.
منابع مشابه
A corpus-based analysis of Korean segments produced by Japanese learners
This paper examines variations of Korean segments produced by Japanese learners of Korean. For corpus-based statistical analysis, we have used Korean read speech corpus produced by Japanese learners. Contrastive analysis of the target language and the source language is performed to provide information for interpreting the results of corpus analysis. Segmental variations are analyzed by alignin...
متن کاملKorean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
This paper introduces a corpus of Korean-accented English speech produced by children (the Korean Children’s Spoken English Corpus: the KC-SEC), which is constructed by Seoul National University. The KC-SEC was developed in support of research and development of CALL systems for Korean learners of English, especially for elementary school learners. It consists of read-speech produced by 96 Kore...
متن کاملModeling Cross-morpheme Pro for Korean Large Vocabulary Cont
In this paper, we describe a cross-morpheme pronunciation variation model which is especially useful for constructing morpheme-based pronunciation lexicon for Korean LVCSR. There are a lot of pronunciation variations occurring at morpheme boundaries in continuous speech. Since phonemic context together with morphological category and morpheme boundary information affect Korean pronunciation var...
متن کاملPronunciation lexicon modeling and design for Korean large vocabulary continuous speech recognition
In this paper, we describe a pronunciation lexicon model which is especially useful for constructing morpheme-based pronunciation lexicon to improve the performance of a Korean LVCSR. There are a lot of pronunciation variations occurring at morpheme boundaries in continuous speech. For modeling of cross-morpheme pronunciation variations, we usually used a context-dependent multiple pronunciatio...
متن کاملAutomatic generation of Korean pronunciation variants by multistage applications of phonological rules
Phonetic transcriptions are often manually encoded in a pronunciation lexicon. This process is time consuming and requires linguistic expertise. Moreover, it is very difficult to maintain consistency. To handle these problems, we present a model that produces Korean pronunciation variants based on morphophonological analysis. By analyzing phonological variations frequently found in spoken Korea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003